In the Claims: 

The claims are as follows: 
1 . (Previously presented) A method for document analysis and retrieval, comprising the 
following steps performed in the order recited: 

transmitting, by a remote host in a first computing system to a web service host in a 
second computing system, a first portion of a document; and 

sequentially transmitting, by the remote host to the web service host, at least one 
additional portion of the document, wherein the first portion and the at least one additional 
portion collectively comprise the entire document, wherein the entire document is adapted to be 
reconstructed and subsequently processed via processing said entire document by the web service 
host, said processing comprising at least one of: 

extracting text from said entire document to configure said text in a text format, if 

said entire document received by said web service host comprises said text in a non-text 

format; 

generating document keys associated with said text from analysis of said text in 
said text format, if said entire document received by said web service host comprises said 
text in said text format, or if said web service host has previously performed said 
extracting such that said text in said text format is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of closest 
categories to the document based on a comparison between the document keys and 
category keys of the given categories, if said entire document received by said web 
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service host comprises said document keys, or if said web service host has previously 
performed said generating such that said document keys are available to said web service 
host. 

2. (Original) The method of claim 1, further comprising prior to the sending step identifying said 
web services host, said identifying comprising: 

executing a Universal Description, Discovery, and Integration (UDDI) search to identify 
one or more web services hosts who can receive said document in chunks and who can perform 
said at least one of said extracting, generating, and stemming; and 

selecting said web services host from said one or more web services hosts. 

3. (Original) The method of claim 1, wherein said transmitting and sequentially transmitting 
comprise respectively transmitting and sequentially transmitting the first portion and the at least 
one additional portion via Internet transmission to said web service host. 

4. (Original) The method of claim 1, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 

5. (Original) The method of claim 1, wherein said processing comprises said extracting, said 
generating, and said determining. 
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6. (Original) The method of claim 1, wherein said processing consists of two of said extracting, 
said generating, and said determining. 

7. (Original) The method of claim 1, wherein said processing comprises said extracting but not 
said generating and not said determining. 

8. (Original) The method of claim 1, wherein said processing comprises said generating but not 
said extracting and not said determining. 

9. (Original) The method of claim 1 5 wherein said processing comprises said determining but not 
said extracting and not said generating. 
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10. (Previously presented) A system for document analysis and retrieval, comprising a first 
computing system that includes a remote host, wherein the remote host is remote relative to a 
web service host in a second computing system, and wherein the remote host is adapted to: 

transmit a first portion of a document to the web service host; and 

sequentially transmit at least one additional portion of the document to the web service 
host, wherein the first portion and the at least one additional portion collectively comprise the 
entire document, wherein the entire document is adapted to be reconstructed and subsequently 
processed via processing said entire document by the web service host, said processing 
comprising at least one of: 

extracting text from said entire document to configure said text in a text format, if 

said entire document received by said web service host comprises said text in a non-text 

format; determine 

generating document keys associated with said text from analysis of said text in 
said text format, if said entire document received by said web service host comprises said 
text in said text format, or if said web service host has previously performed said 
extracting such that said text in said text format is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of closest 
categories to the document based on a comparison between the document keys and 
category keys of the given categories, if said entire document received by said web 
service host comprises said document keys, or if said web service host has previously 
performed said generating such that said document keys are available to said web service 
host. 
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11. (Original) The system of claim 10, wherein the remote host is adapted to identify said web 
services host by: 

executing a Universal Description, Discovery, and Integration (UDDI) search to identify 
one or more web services hosts who can receive said document in chunks and who can perform 
said at least one of said extracting, generating, and determining; and 

selecting said web services host from said one or more web services hosts. 

12. (Original) The system of claim 10, wherein to send transmit and to sequentially transmit 
comprises to respectively transmit and sequentially transmit the first portion and the at least one 
additional portion via Internet transmission to said web service host. 

13. (Original) The system of claim 10, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 

14. (Original) The system of claim 10, wherein said processing comprises said extracting, said 
generating, and said determining. 

15. (Original) The system of claim 10, wherein said processing consists of two of said extracting, 
said generating, and said determining. 



16. (Original) The system of claim 10, wherein said processing comprises said extracting but not 
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said generating and not said determining. 

17. (Original) The system of claim 10, wherein said processing comprises said generating but not 
said extracting and not said determining. 

18. (Original) The system of claim 10, wherein said processing comprises said determining but 
not said extracting and not said generating. 
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19. (Previously presented) A method for document analysis and retrieval, comprising the 
following steps performed in the order recited: 

receiving, by a web service host in a second computing system from a remote host in a 
first computing system, a first portion of a document; 

sequentially receiving, by the web service host from the remote host, at least one 
additional portion of the document, wherein the first portion and the at least one additional 
portion collectively comprise the entire document; 

reconstructing the entire document from the first portion and the at least one additional 
portion; and 

processing the entire document by the web service host, wherein said processing 
comprises at least one of: 

extracting text from said entire document to configure said text in a text format, if 
said entire document received by said web service host comprises said text in a non-text 
format; 

generating document keys associated with said text from analysis of said text in 
said text format, if said entire document received by said web service host comprises said 
text in said text format, or if said web service host has previously performed said 
extracting such that said text in said text format is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of closest 
categories to the document, if said entire document received by said web service host 
comprises said document keys, or if said web service host has previously performed said 
generating such that said document keys are available to said web service host. 
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20. (Original) The method of claim 19, wherein the web services host is listed in a Universal 
Description, Discovery, and Integration (UDDI) registry as being able to receive said document 
in chunks and being able to perform said at least one of said extracting, generating, and 
determining. 

21. (Original) The method of claim 19, wherein said receiving and sequentially receiving steps 
comprise receiving the first portion and the at least one additional portion via Internet 
transmission from said remote host. 

22. (Original) The method of claim 19, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 

23. (Original) The method of claim 19, wherein said processing comprises said extracting, said 
generating, and said determining. 

24. (Original) The method of claim 19, wherein said processing consists of two of said 
extracting, said generating, and said determining. 

25. (Original) The method of claim 19, wherein said processing comprises said extracting but not 
said generating and not said determining. 
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26. (Original) The method of claim 19, wherein said processing comprises said generating but 
not said extracting and not said determining. 

27. (Original) The method of claim 19, wherein said processing comprises said determining but 
not said extracting and not said generating. 

28. (Original) The method of claim 19, wherein said determining comprises: 

comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 

determining said set of closest categories based on said determination. 

29. (Original) The method of claim 19, wherein said processing comprises said determining, and 
wherein the method further comprises: 

creating a search string, said search string comprising a logical function of a subset of 
said document keys; 

submitting said search string to a search engine; 

receiving links to related documents from said search engine, said links being based on 
said search string; and 

returning said links to said remote host. 
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30. (Previously presented) A system for document analysis and retrieval, comprising a second 
computing system that includes a web service host, wherein the web service host is remote 
relative to a remote host in a first computing system, and wherein the web service host is adapted 
to: 

receive a first portion of a document from the remote host; 

sequentially receive at least one additional portion of the document from the remote host, 
wherein the first portion and the at least one additional portion collectively comprise the entire 
document; 

reconstruct the entire document from the first portion and the at least one additional 
portion; and 

implement processing the entire document, said processing comprising at least one of: 

extracting text from said entire document to configure said text in a text format, if 
said entire document received by said web service host comprises said text in a non-text 
format; 

generating document keys associated with said text from analysis of said text in 
said text format, if said entire document received by said web service host comprises said 
text in said text format, or if said web service host has previously performed said 
extracting such that said text in said text format is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of closest 
categories to the document, if said entire document received by said web service host 
comprises said document keys, or if said web service host has previously performed said 
generating such that said document keys are available to said web service host. 
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31. (Original) The system of claim 30, wherein the web services host is listed in a Universal 
Description, Discovery, and Integration (UDDI) registry as being able to receive said document 
in chunks and being able to perform said at least one of said extracting, generating, and 
determining. 

32. (Original) The system of claim 30, wherein to receive and sequentially receive comprise to 
receive the first portion and the at least one additional portion via Internet transmission from said 
remote host. 

33. (Original) The system of claim 30, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 

34. (Original) The system of claim 30, wherein said processing comprises said extracting, said 
generating, and said determining. 

35. (Original) The system of claim 30, wherein said processing consists of two of said extracting, 
said generating, and said determining. 

36. (Original) The system of claim 30, wherein said processing comprises said extracting but not 
said generating and not said determining. 
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37. (Original) The system of claim 30 5 wherein said processing comprises said generating but not 
said extracting and not said determining. 

38. (Original) The system of claim 30, wherein said processing comprises said determining but 
not said extracting and not said generating. 

39. (Original) The system of claim 30, wherein said determining comprises: 

comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 

determining said set of closest categories based on said determination. 

40. (Original) The system of claim 30, wherein said processing comprises said determining, and 
wherein the method further comprises: 

creating a search string, said search string comprising a logical function of a subset of 
said document keys; 

submitting said search string to a search engine; 

receiving links to related documents from said search engine, said links being based on 
said search string; and 

returning said links to said remote host. 



41. (Previously presented) The system of claim 1, wherein said determining comprises: 
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comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 

determining said set of closest categories based on said determination. 

42. (Previously presented) The method of claim 41, wherein said comparing comprises 
computing said distance for each category as a dot product of a vector of the document keys and 
a vector of the category keys of each category. 

43. (Previously presented) The system of claim 10, wherein said determining comprises: 

comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 

determining said set of closest categories based on said determination. 

44. (Previously presented) The system of claim 43, wherein said comparing comprises computing 
said distance for each category as a dot product of a vector of the document keys and a vector of 
the category keys of each category. 

45. (Previously presented) The method of claim 28, wherein said comparing comprises 
computing said distance for each category as a dot product of a vector of the document keys and 
a vector of the category keys of each category. 
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46. (Previously presented) The system of claim 39, wherein said comparing comprises computing 
said distance for each category as a dot product of a vector of the document keys and a vector of 
the category keys of each category. 

47. (Previously presented) The method of claim 1, wherein said sequentially transmitting at least 
one additional portion of the document is performed after said transmitting the first portion of the 
document has been performed. 

48. (Previously presented) The method of claim 10, wherein the remote host is adapted to 
sequentially transmit the at least one additional portion of the document to the web service host 
after the remote host has transmitted the first portion of the document to the web service host. 

49. (Previously presented) The method of claim 19, wherein said sequentially receiving at least 
one additional portion of the document is performed after said receiving the first portion of the 
document has been performed. 

50. (Previously presented) The method of claim 30, wherein the web service host is adapted to 
sequentially receive the at least one additional portion of the document from the remote host after 
the web service host has received the first portion of the document from the remote host. 
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